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ABSTRACT 

We analyse the impact of galaxy-halo misalignment on the ability of weak lensing studies to 
constrain the shape of dark matter haloes, using a combination of the Millennium dark matter 
A-body simulation and different semi-analytic galaxy formation models, as well as simpler 
Monte Carlo tests. Since the distribution of galaxy-halo alignments is not known in detail, we 
test various alignment models, together with different methods of determining the halo shape. 
In addition to alignment, we examine the interplay of halo mass and shape, and galaxy colour 
and morphology with the resulting stacked projected halo shape. We find that only in the 
case where significant numbers of galaxy and halo minor axes are parallel does the stacked, 
projected halo axis ratio fall below 0.95. When using broader misalignment distributions, 
such as those found in recent simulations of galaxy formation, the halo ellipticity signal is 
washed out and would be extremely difficult to measure observationally. It is important to note 
that the spread in stacked halo axis ratio due to theoretical unknowns (differences between 
semi-analytic models, and between alignment models) are much bigger than any statistical 
uncertainty: It is naive to assume that, simply because ACDM predicts aspherical haloes, the 
stacked projected shape will be elliptical. In fact, there is no robust ACDM prediction yet for 
this procedure, and the interpretation of any such elliptical halo signal from lensing in terms 
of physical halo properties will be extremely difficult. 
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1 INTRODUCTION 

Dark matter haloes are irregularly-shaped virialised clumps of col- 
lisionless matter. In the simplest model of the Universe that is most 
compatible with current observations (ACDM), dark matter dom- 
inates the mass budget and haloes form from the collapse and hi- 
erarchical merging of matter in overdense regions. Galaxies form 
from gas that originally clustered with the dark matter, following 
baryonic processes (e.g. radiative cooling, star formation, etc) that 
lead to structures with very different properties and behaviour to 
the nearly-self-similar dark matter haloes. 

Theoretical properties of dark matter haloes are now very 
well established, following decades of research using W-body 
simulations and advances in computing power. This work includes 
characterising the distributions and time-dependence of various 
properties, and correlations between them. The physical properties 
include spin (angular momentum), shape, density profile and 
concentration, phase-space density profile, clustering, and the rela- 
tionship to structures on larger and smaller scales. Recent examples 
of such studies using large-scale cosmological simulations include 
Shaw et al. (2006); Altay, Colberg & Croft (2006); Allgood et al. 
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(2006) ; Harker et al. (2006); Hayashi, Navarro & Springel (2007); 
Hahnetal. (2007b,a); Bettetal. (2007, 2010); Maccio et al. 

(2007) ; Maccio, Dutton & van den Bosch (2008); Neto et al. 
(2007); Gao et al. (2008); Zhang et al. (2009); Davis & Natarajan 
(2009); Boylan-Kolchin et al. (2010); Mufioz-Cuartas et al. 
(2011); Wangetal. (2011); Ludlow et al. (2011); Prada et al. 
(2011) and the recent novel studies using principal component 
analysis (Skibba & Maccio 2011; Jeeson-Daniel et al. 2011). Halo 
properties were recently reviewed in Taylor (2011). Of particular 
interest in this paper are dark matter halo shapes, which are known 
to have a broad distribution, with a preference for prolateness. 

Since "dark" matter is by definition transparent, it is very hard 
to measure these properties directly using standard direct astronom- 
ical observations. Methods that utilise gravitational lensing how- 
ever are sensitive to the entire mass distribution, not just the ra- 
diating baryonic component. This has lead to gravitational lens- 
ing being proposed as a key technique for studying halo proper- 
ties observationally (see e.g. the reviews of Hoekstra & Jain 2008; 
Massey, Kitching & Richard 2010; Huterer 2010). 

Early work on measuring halo mass distributions using weak 
galaxy-galaxy lensing was performed by Kaiser & Squires (1993); 
Wilson, Cole & Frenk (1996a,b); Schneider & Bartelmann (1997); 
Schneider & Rix (1997). Following these, Natarajan & Refregier 
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(2000) proposed a technique for using weak gravitational lensing to 
measure the ellipticity of haloes (see also Brainerd & Wright 2000, 
2002). Consider the shear signal from weak lensing of background 
('source') galaxies, due to the mass associated with a foreground 
('lens') galaxy. In practice, this galaxy-galaxy lensing shear sig- 
nal will be far too weak to be detectable from single lens galax- 
ies, so the signal from many lens systems needs to be stacked. If 
we compare the tangential shear either side of the lens galaxy im- 
age's minor axis with that either side of its major axis, then we can 
obtain a measurement of the flattening of the surrounding matter 
distribution. However, if the ellipticities of halos and galaxies are 
not consistently aligned, the stacking procedure will result in this 
anisotropic shear signal being washed out. 

This method was first used by Hoekstra, Yee & Gladders 
(2004), on data from the Red-sequence Cluster Survey. Assuming a 
model in which the lensing halo and galaxy ellipticities are related 
through e Mo = /e ga i, they found a best-fit value of / = 0.77+" ^ 
(68% confidence level), and claimed to exclude the possibility of 
spherical haloes (f = 0) at 99.5% confidence. Parker et al. (2007), 
using the CFHT Legacy Survey, measured the ratio of the tangen- 
tial shears to be 0.76± 0. 10, excluding spherical haloes at ~ 2tr and 
implying (via Brainerd & Wright 2000) a halo ellipticity of ~ 0.3. 
They also attempted to select mostly elliptical galaxies, which re- 
sulted in a more significant detection of ellipticity. 

Mandelbaum et al. (2006) performed a very thorough analy- 
sis using data from the Sloan Digital Sky Survey (SDSS), which 
included photometric redshifts and galaxy colours (not available 
to the other two studies). However, they did not manage to defini- 
tively detect halo ellipticity, although they found a hint at different 
alignments for different galaxy types. Their work showed how sen- 
sitive the results are to the models used for interpretation: If they 
assumed Gaussian errors with a power law density profile (as in 
Hoekstra, Yee & Gladders 2004), they found / = 0.1 ± 0.06 and 
/ = -0.8 ± 0.4 for red and blue galaxies resprectively; if they in- 
stead assumed non-Gaussian errors and a Navarro, Frenk & White 
(1997) density profile, they instead found / = 0.60±0.38 (reds) and 
/ = —1.4*20 (blues), where negative numbers mean anti-alignment 
of mass and light. 

Unambiguous detection of dark matter halo ellipticity has 
been seen as an important goal, because it offers the prospect of 
falsifying alternative theories of gravity, such as MOND/TeVeS 
(Milgrom 1983; Bekenstein 2004) or MOG/STVG (Moffat 2006; 
Moffat & Toth 2009a). Such theories suffer from being more the- 
oretically and computationally challenging compared to simple 
collisionless matter in Newtonian gravity, which has resulted in 
their theoretical predictions being less developed at present. Nev- 
ertheless, the formalism for gravitational lensing has been devel- 
oped both for TeVeS (Bekenstein 2004; Chiu, Ko & Tian 2006) 
and recently for STVG (Moffat & Toth 2009c). Predictions of lens- 
ing from MOND actually predate the relativistic description from 
TeVeS (Mortlock & Turner 2001), and predictions for the equiva- 
lent counterpart of "halo" shapes in MOND was given in Milgrom 

(2001) and Sellwood & Kosowsky (2002). A robust prediction 
from TeVeS/MOND is that the lensing signal away from the lens 
galaxy should be isotropic. Thus any detection of ellipticity - re- 
gardless of whether it agrees with the predictions from ACDM sim- 
ulations - would falsify TeVeS. However, this result will only be 
strictly true for a well-isolated lens galaxy, which is harder to estab- 
lish in practice. The presence of mass from nearby galaxies can pro- 
duce effects which go against our intuitive understanding of gravity, 
e.g. STVG violates Birkhoff 's theorem (Moffat & Toth 2009b), and 
can appear to fit the Bullet Cluster (Brownstein & Moffat 2007) (al- 



though in that context, Springel & Farrar 2007 showed that neglect- 
ing the hydrodynamics of the baryons is also greatly misleading). 
Thus, interpreting the results of anisotropic shear measurements, 
whether circular or elliptical, should be done with caution. 

The problem of galaxy-halo alignment is central to this 
work. There is, essentially, no robust prediction of the relative 
orientation of galaxies within their haloes from theory or simu- 
lation. This is not to say that is has not been measured, but that 
the physical processes involved vary significantly from simulation 
to simulation, and the number of objects studied is often still 
small (< 10 2 ) compared to the large statistical samples used in 
observations and dark matter simulations (> 10 6 ). However, all 
simulation work has been consistent in predicting a reasonably 
broad distribution of galaxy-halo alignments, albeit with variation 
in the median angle. These include van den Bosch et al. (2002), 
van den Bosch, Abel & Hernquist (2003), Yoshidaetal. (2003), 
Chen, Jing & Yoshikawa (2003), Sharma & Steinmetz (2005), 
Bailin et al. (2005), Gustafsson, Fairbairn & Sommer-Larsen 
(2006), Croft et al. (2009), Romano-Diaz et al. (2009), Bett et al. 
(2010), Hahn, Teyssier & Carollo (2010), and Deason et al. (201 1). 

The qualitative impact of galaxy-halo misalignment on 
the method of Natarajan & Refregier (2000) is intuitive and 
well-known, but it has not been considered quantitatively. On 
the other hand, different models of galaxy-halo alignment 
have been used for studies of the intrinsic alignment problem 
in galaxy-galaxy lensing (Heavens, Refregier & Heymans 2000; 
Heymans et al. 2004, 2006), and for modelling the satellite galaxy 
distribution when considering cluster lensing (Okumura, Jing & Li 
2009; Okumura & Jing 2009). Furthermore, variation between the 
predicitons of different galaxy formation simulations and models, 
and even from different methods of measuring shapes of simu- 
lated haloes, are rarely considered when observations are compared 
to "the" theoretical prediction. The complex systematic problems 
that can affect observations and prevent straightforward interpreta- 
tion, are however very well studied (e.g. Brainerd & Wright 2000; 
Hoekstra, Yee & Gladders 2004; Mandelbaum et al. 2005, 2006; 
Howell & Brainerd 2010). 

In this paper, we focus therefore on quantifying the impact of 
galaxy-halo misalignment on stacked projected halo shapes, using 
a range of different models for galaxies, halo shapes and align- 
ment distributions to highlight the uncertainty in the theoretical 
prediction. We do not proceed to make a direct anisotropic shear 
predicion from our results, as this is already well studied (e.g. 
Howell & Brainerd 2010), and will only serve to reduce any ellip- 
ticity signal. 

This paper is organised as follows. In section 2, we describe 
in detail the simulation and series of models we use. This includes 
the simulation and galaxy formation models (section 2. 1 ), different 
methods of measuring halo shapes from simulations (§2.2), and the 
different alignment models we consider (§2.3). Section 3 describes 
simple Monte Carlo tests of the impact of our alignment models 
on distributions of halo shapes. We present our results in section 4, 
as series of axis ratios generated by stacking samples of projected 
halo shapes, showing how they depend on the distributions of halo 
and galaxy properties. We discuss our conclusions in section 5. 
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2 MODELLING THE IMPACT OF MISALIGNMENT 
2.1 The simulation 

We use the original 1 Millennium Simulation (MS, Springel et al. 
2005), a very large A'-body cosmological dark matter simulation of 
the large-scale structure of a ACDM universe. This simulation re- 
solves many millions of objects at each timestep, providing the sta- 
tistical power for decribing distributions of dark matter halo prop- 
erties very percisely. The simulation is in a periodic box of length 
500/i~'Mpc, populated with over 10 billion collisionless dark mat- 
ter particles (2160 3 ), each of mass m p = 8.60657 x 10 8 /r'Mo 
and a gravitational softening length of 5.0A~'kpc. The simulation 
code used was L-Gadget-2, a version of the Tree-PM code Gadget- 
2 (Springel 2005) that was specially optimised for massively paral- 
lel computations and low memory consumption. 

The MS uses a set of cosmological parameters that were cho- 
sen to be consistent with the results of the 2dFGRS (Percival et al. 
2002) and WMAP-1 (Spergel et al. 2003). We write cosmologi- 
cal density parameters as Q.j(z) = pd.z) I pdtiz), in terms of the 
mass density 2 of component i and the critical density p c ,it(z) = 
3H(z) 2 /(8nG), where the Hubble parameter is H(z). For the cos- 
mological constant, total mass, and baryonic mass, the MS uses 
values of n A . = fi A (z = 0) = 0.75, C2 M .o = 0.25, and £\ = 0.045. 
The present-day value of the Hubble parameter is parameterised in 
the standard way as H = 100/j km s _I Mpc~' , where h = 0.73. The 
spectral index is n = 1.0 and the linear-theory mass variance in 
8 ft^'Mpc spheres at z = is given by erg = 0.9. 

2.1.1 Semi-analytic models 

Various halo and galaxy catalogues from the MS have 
been made publicly available through an online database 3 
(Lemson & the Virgo Consortium 2006). They are based on two 
independent semi-analytic code development programmes, that of 
the ICC in Durham (based on the Galform model), and the MPA 
in Garching. While these models (and those of other groups) have 
been very successful in many regards, no model has yet matched 
the full distribution of galaxy properties at all luminosities, colours 
and redshifts simultaneously. The galaxy catalogues we shall use 
are the ICC model of Bower et al. (2006) (hereafter B06), and 
the MPA model of De Lucia & Blaizot (2007) (hereafter DLB07). 
Both models are based on previous codes, incorporating new 
features, and retaining/improving others. The B06 model builds 
on the previous models of Cole et al. (2000) and Benson et al. 
(2003), whereas the DLB07 model is based on the previous 
work of Kauffmann & Haehnelt (2000), Springel et al. (2001), 
De Lucia, Kauffmann & White (2004), Springel et al. (2005), 
Croton et al. (2006), and De Lucia et al. (2006). Note that further 
models have been produced in subsequent work by both groups. 
We have chosen to use the B06 and DLB07 models because these 
versions have been very widely used, and have already been subject 
to detailed model comparison work. Parry, Eke & Frenk (2009) 
recently performed a detailed study of the different morphological 



A second Millennium Simulation (MS-II) has since been per- 
formed, using the same number of particles in a smaller volume; see 
Boylan-Kolchin et al. (2009) for details. 

2 One can write the equivalent mass-density of the cosmological constant 
A as pa — Ac 2 /(8;rG). 

3 http: //gavo .mpa- garching .mpg . de/MyMillennium3/ and 
http: //galaxy- catalogue, dur .ac .uk: 8088/MyMillennium/ 



mixes predicted by these two models, and discussed the model 
differences that lead to these variations. Further model-comparison 
work was carried out in De Lucia et al. (2010), concentrating on 
the implementations of mergers and gas cooling. 

While both models have developed from essentially the same 
principles (e.g. White & Frenk 1991; see also the review of Baugh 
2006), and attempt to model the same processes, significant differ- 
ences nevertheless exist in the details of the modelling. Different 
methods are used for calculating the gas cooling rates, and they 
use different stellar initial mass functions and models for attenua- 
tion by dust. Both models use the same stellar population synthesis 
model, and implement feedback from stellar winds and supernovae, 
injecting energy back into the gas. Galaxy mergers (distinct from 
halo mergers) and disc instabilities are treated differently in the two 
models, with different triggers for starbursts. Both models also im- 
plement the growth of black holes and feedback from AGN in very 
different ways. Finally, the models also differ in the halo definition 
used, the merger tree algorithm, and the way in which galaxy cal- 
culations are linked to the merger trees. We refer the reader to the 
papers referenced above for full details of the models. 

2.1.2 Halo Identification 

We define our haloes from the simulation particles using a multi- 
stage process, incorporating information about spatial clustering, 
binding energy, and substructure dynamics. This is the so-called 
"merger-tree halo" definition originally described in Harker et al. 
(2006), to which we refer the reader for a full description. We sum- 
marise the main points here. 

The procedure starts with the simulation particles grouped 
by proximity, using the well-known Friends-of-Friends algorithm 
(FOF, e.g. Davis et al. 1985), with a linking length of b = 0.2 times 
the mean interparticle separation (e.g. Porciani, Dekel & Hoffman 
2002). Within each FOF group, self-bound substrucures are iden- 
tified using the Subfind program (Springel et al. 2001). This is it- 
self a two-stage process, first identifying candidate substructures 
by finding peaks in the density field, then performing an iterative 
unbinding procedure, sucessively removing particles not gravita- 
tionally bound to the candidate (a minimum mass of 20 particles 
is imposed for substructures). This results in a set of FOF particle 
groups, each comprising some unbound particles ("fuzz") plus zero 
or more self-bound structures, usually divided conceptually into 
the main body of the halo (the most-massive substructure, MMSS), 
plus subhaloes. 

Using the FOF/Subfind catalogues from different output snap- 
shots in the simulation, merger trees are constructed, identifying 
structures in one snapshot with their progenitors and descendents 
in other snapshots (Harker et al. 2006). Our haloes are defined us- 
ing information from the merger trees as a final stage of refinement. 
Firstly, the fuzz particles are excluded, leaving the basic halo as the 
set of bound structures originating from the same FOF group. Then 
subhaloes are subjected to a splitting algorithm, allowing them to 
be separated off from the original halo. This attempts to identify 
subhaloes that are spatially but not dynamically linked to the halo. 
For example, a subhalo might have been linked into a FOF group 
solely by fuzz particles (now excluded), or it could simply be flying 
past the main halo without yet becoming bound to it. 

This halo definition, and the merger trees themselves, were 
originally designed for use with the Galform semi-analytic model 
(following Helly et al. 2003), and its application to the MS in the 
B06 model. Bett et al. (2007) studied the effect of different halo 
definition algorithms, comparing haloes from this method with 
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those from simply using FOF without refinement, and those de- 
fined by an overdensity criterion to give a spherical halo boundry 
at the virial radius. In addition to a visual comparison (in real and 
velocity space), they also compared halo spin, shapes and cluster- 
ing. In terms of halo shapes, haloes defined by a spherical boundry 
were (unsurprisingly) biased towards spherical, and the simple FOF 
haloes had a much broader distribution of shapes than the merger 
tree haloes. It should be noted that, while further testing of differ- 
ent halo-finding algorithms is beyond the scope of this paper, the 
choice of algorithm will affect the results and should be borne in 
mind when interpreting results here and in other studies. 

The DLB07 model uses a slightly different halo definition, 
omitting the splitting procedure outlined above. This means that, 
from the point of view of the DLB07 galaxies, a halo consists of all 
the bound structures associated with the parent FOF halo. These 
halo catalogues therefore have slightly fewer objects than the halo 
catalogues we use here (and were used in B06). However, since 
both halo catalogues are built up from the same set of Subfind struc- 
tures, it is straightforward to identify galaxies from both models 
that are associated with the same corresponding dark matter struc- 
ture. 

2.1.3 Selecting halo-galaxy systems 

In this paper, we are interested in the possibility of measuring the 
shapes of sub-cluster-mass haloes observationally. Therefore, we 
should attempt to use observational selection methods when pick- 
ing objects for study from the raw halo catalogues. At the same 
time, it is important when working with W-body simulations to de- 
fine and select objects for study carefully to guard against biases 
due to numerical effects. 

A technique commonly-used when selecting haloes from sim- 
ulations is to attempt to exclude unvirialised systems. Bett et al. 
(2007) applied a cut on the halo energy ratio (to select haloes in 
'quasi-equilibrium', as an approximation to virialisation), only ac- 
cepting haloes with |1 + 2T/U\ < 0.5, where T is the kinetic and 
U the potential energy. While Bett et al. (2007) studied halo shapes 
in the MS, a large part of that paper was focused on the halo spin 
parameter A. As originally defined (Peebles 1969, 1971), A is re- 
ally only valid for isolated, virialised haloes, so this cut played 
an important role in excluding invalid objects. More generally, a 
virialisation-based cut can help exclude haloes that are poorly de- 
fined, for example those that are currently undergoing a merger. 
In this case, the boundaries of the halo itself, and thus most of its 
other properties, are also poorly defined. However, aspects of our 
halo definition - excluding unbound particles, and splitting off dy- 
namically separate subhaloes - go a long way towards solving these 
problems, such that the explicit cut in 1 1 +2T/U\ only effects a rel- 
atively small minority of haloes (see Bett et al. 2007). Since such 
a cut would be very difficult to apply accurately in observational 
data, we choose to not apply it here. 

Another important cut usually applied to simulations is on the 
minimum number of particles for a halo, to ensure that haloes are 
well-resolved. Bett et al. (2007) showed that the shapes of haloes in 
the MS realised with fewer than ~ 10 3 particles were biased away 
from spherical towards prolateness. We do not automatically apply 
this cut, but we will test its impact on our results. This is related, in 
principle, to the cut in galaxy magnitude we describe below. 

The physical processes experienced by galaxies in clusters are 
different to those of galaxies in lower-mass haloes. Furthermore, 
the observational techniques used to study them are also different; 
the method we are concerned with here does not apply in the same 



way. We therefore exclude galaxy clusters, by applying an upper 
mass cut of M < IO^/t'Mq (in practice, a particle-number cut 
atiVp < 11 619). This cut is also difficult to perform observation- 
ally, but it could be approximated by, for example, excluding the 
brightest galaxies (presuming that they are BCGs), or excluding re- 
gions where the galaxy number density is high. Some methods are 
detailed in Brainerd (2005). 

We are interested in the shape of haloes of individual galaxies. 
We therefore need to maintain a 1:1 relationship between galax- 
ies and haloes: this means excluding satellite galaxies and sub- 
haloes, and restricting ourselves to central galaxies only. (While the 
distinction between haloes and subhaloes is very important when 
analysing data from simulations, it is admittedly much harder to as- 
certain observationally.) Since the halo definition algorithm we use 
corresponds to that used in the B06 model, we shall base our analy- 
sis on that catalogue. This means that we can simply select galaxies 
from B06 that have been tagged as 'centrals' in the database (i.e. 
using Type = in the B06 database table). 

Galaxies from the DLB07 catalogue are selected by identify- 
ing the galaxies belonging to the same Subfind structures as the cor- 
responding B06 galaxies. In most cases, these will also be central 
galaxies (as the MMSS of a B06 halo is likely to be the MMSS of 
a DLB07 halo). However, sometimes a halo identified in the B06 
model (and our halo catalogues) will be considered to be a sub- 
halo in the DLB07 model. This means that the associated galaxy 
could have evolved significantly differently to its B06 counterpart, 
as (in both models) central and satellite galaxies are treated differ- 
ently. Nevertheless, each galaxy will still be the central galaxy of 
the same mass structure. While this means that our DLB07 galaxy 
sample is not the same as just selecting Type = galaxies in the 
DLB07 database table, it should not have a very strong impact on 
the main results of this paper. Indeed, since this is effectively in- 
corporating the theoretical uncertainty in determining satellites and 
central galaxies, it helps to overcome some of the artificialness of 
the Type = selection, and mimic to some degree the difficulty in 
distinguishing centrals/satellites observationally. While a full study 
on the systematic differences between Type = galaxies and field 
galaxies in general is beyond the scope of this paper, it is important 
to note that the halo mass function means that most galaxies (> 80 
per cent) of a given brightness are centrals: Lower-mass haloes that 
can host galaxies of a given magnitude as centrals are always much 
more abundant than higher-mass haloes able to host them as satel- 
lites (although such galaxies are likely to differ systematically in 
other properties). Hence, the Type = cut retains most galaxies 
of each magnitude. 

In an effort to match observational samples as much as pos- 
sible, we select ('lens') galaxies using a cut in apparent observer- 
frame r-band magnitude. While similar studies with the SDSS have 
selected galaxies with r < 19 (Mandelbaum et al. 2006), we take 
our limit from the upcoming KIDS survey, and use r < 24.3 (e.g. 
Kuijken 2006, 2010). This is admittedly a rather optimistic limit, 
but we want to avoid handicapping our data unnecessarily. 

We perform our analysis on the redshift z = 0.32 data from 
the MS (output snapshot 52), again based on the expected median 
redshift of gravitational lenses in KIDS. At this redshift, our limit- 
ing apparent magnitude of r llm = 24.3 translates into an (observer- 
frame) absolute magnitude of M,. - 5 log 10 h = r lim - 5 log 10 D(z) + 
5 = -16.1, where D(z) is the luminosity distance in parsecs. We 
show the joint distributions of galaxy magnitude and halo mass for 
our two semi-analytic models in Fig. 1. 

While, in reality, objects would be observed over a broad range 
of redshifts, it is more straightforward to use just a single snapshot 
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Figure 1. The joint distributions of halo mass with (observer-frame) r-band 
apparent magnitude, from the B06 model (top) and the DLB07 model (bot- 
tom). Dashed vertical lines show the masses corresponding to 1000 particles 
and 11619 particles, the latter being the upper mass limit we will use in our 
analysis (10 13 /r'M ). 

for our purposes. This gives us many millions of objects already, so 
we do not need to use other snapshots to improve our sample size. 
We do perform our analysis at other discrete redshifts however, and 
show these results in Appendix C. 

In addition to galaxy magnitude, we shall also be looking at 
morphology and colour as ways of selecting objects to improve 
halo shape measurements. For morphology, we use the stellar-mass 
bulge-to-total ratio B/T. We classify galaxies according to whether 
they are bulge-dominated or disc-dominated: In particular, for con- 
venience we describe those with B/T > 0.5 as "ellipticals", and 
those with B/T < 0.5 as "discs". The B/T distribution from both 
semi-analytic models is strongly bimodal: there is a very strong 
peak for discs at B/T < 0.005, a much smaller but similarly narrow 
peak for ellipticals at B/T ^ 0.995. There is also a significant but 
low-population set of intermediate-morphology galaxies, covering 
34% of selected galaxies at z — 0.3 in the B06 model, and 27% 
for the DLB07 model. Parry, Eke & Frenk (2009) split the galaxy 
populations from these models into three samples (B/T < 0.4, 
0.4 < B/T < 0.6 and B/T > 0.6), but for our purposes simply 
splitting into two samples at B/T = 0.5 is sufficient. 

We have tested three different measures of galaxy morphol- 



ogy in the B06 model: by stellar mass, by r-band magnitude, and 
by g-band magnitude. While the latter two correlate well with each 
other, they can scatter somewhat when compared to morphologies 
determined by stellar mass, with more galaxies appearing to have 
more intermediate morphologies when determined by magnitude. 
However, dividing our galaxy population simply into two broad 
morphological categories means that the vast majority of galaxies 
fall into the same category regardless of the measure used. 

Following Strateva et al. (2001) (see also Baldry et al. 2004), 
Mandelbaum et al. (2006) make the division between "red" and 
"blue" galaxies at SDSS rest-frame u — r = 2.22. The semi-analytic 
models do not reproduce the observed colour distribution, although 
the colours are easily divided into red and blue samples. Empiri- 
cally, we find that in the B06 model, we need to place that cut at 
rest-frame u- r = 0.9. For the DLB07 model, only the observer- 
frame magnitudes are available, meaning we cannot directly com- 
pare galaxy colours at different redshifts since a A"-correction has 
not been applied. However, examination of the colour distributions 
at z — 0.32 suggests an empirical colour-cut of observer-frame 
u- r = 3.5. We show the colour distributions at different redshifts 
in Fig. 2. 

It is important to note that galaxy colour and morphology are 
distinct, albeit related properties - see for example the comparison 
of galaxy morphologies and colour in the SDSS in Benson et al. 
(2007). We will discuss how the distributions of colour and mor- 
phology are related for the galaxy catalogues we use, in the Results 
section. 

2.2 Shapes of dark matter haloes 

Dark matter haloes are irregularly-shaped clumps of material, de- 
fined in principle by an envelope that demarcates either a given 
mass-density threshold, or - if one is more concerned with dynamic 
properties, such as virialisation - a threshold in phase-space den- 
sity. In practice however, for ease of both definition and compar- 
ison between haloes, the shape of a halo is usually characterised 
by the ellipsoid defined by the eigenvectors and eigenvalues of a 
matrix describing the halo's internal mass distribution. There are 
many ways to measure the mass distribution however, and differ- 
ent authors measure halo shapes from simulations using different 
matrices. Each of these methods have their own advantages and 
disadvantages, and not all are so relevant for observational stud- 
ies. Observations do need to be compared with theoretical predic- 
tions from simulations however, and this can be complicated by the 
variation due to the range of methods used by theorists. Therefore, 
while an in-depth comparison of different methods of measuring 
halo shapes is outside the scope of this paper, we nevertheless elect 
to test four different shape tensors rather than picking just one, to 
highlight the variation in the theoretical predictions. We compare 
them briefly at the end of this subsection. 

We describe the four halo shape algorithms we use below. 
Throughout, the tensor/vector components have indices = 
(1,2,3), and the halo has A' particles indexed by p. The particles 
have positions r p = (r Pt i,r p ^, r P 3) T with respect to the halo centre. 

2.2.1 The simple inertia tensor 

The inertia tensor I directly relates angular momentum / and angu- 
lar velocity a> through / = \oj, and has components 
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u-r (rest frame) u - r (observer frame) it - r (observer frame) 

Figure 2. The distributions of galaxy colours in the two semi-analytic galaxy models, at the redshift we use for our main analysis (black), plus the additional 
redshifts used in Appendix C (red and blue). Left: Rest-frame colours from B06 with the colour-cut at u - r = 0.9 shown as a dashed line. Middle: Observer- 
frame colours from the B06 model. Arrows indicate colours corresponding to the rest-frame cut, by examining the distribution of observer-frame colours for 
rest-frame 0.899 < u — r < 0.901. Upper arrows mark the medians (1.19 at z = 0.17, 1.40 at z = 0.32, and 1.44 at z = 0.50), and lower arrows mark the modal 
colours (1.40 at z, = 0.17, 1.82 atz = 0.32, and 2.07 at z = 0.50). Right: Same for the DLB07 model, with the empirical colour-cuts marked (2.77 for z = 0.17, 
3.50 for z. = 0.32, and 3.66 for z = 0.50). 



where Sy is the Kronecker delta. Choosing a coordinate frame in 
which I is diagonal (i.e. the eigenframe) is equivalent to finding the 
preferred axes of rotation of the object, i.e. the set of axes in which 
a torque about one does not induce a rotation about another. The 
axis directions are given by the eigenvectors, and the eigenvalues 
are the moments of inertia. The axis lengths a > b > c are given 
by the square root of linear combinations of the moments of inertia 
per unit mass (e.g. Bett et al. 2007). These principal axes define the 
equivalent homogeneous ellipsoid that has the same moments of 
inertia - i.e. the same behaviour under rotations - as the halo itself. 
Axis ratios are usually denoted s = c/a, q = b/a, and p = c/b. 

Unless these relations to / and oj are directly relevant how- 
ever, it is slightly simpler computationally to use the tensor of the 
quadrupole moments of the mass distribution, M, which has com- 
ponents 



I' 



(3) 



(2) 



This is done to remove bias due to, for example, large subhaloes lo- 
cated on the outskirts of the halo (Gerhard 1983). In this "reduced" 
inertia tensor (which we shall refer to as M R(ill ), the particles are 
projected onto a unit sphere, and the shape measured is a descrip- 
tion of the mass distribution in different directions; each particle 
contributes its mass equally. This can provide a better description 
of the "underlying" halo shape rather than just the distribution of 
subhaloes; whether or not one considers the subhaloes to be a dis- 
tinctive aspect of the halo shape or an annoyance that needs to be 
removed depends on the study in question. For observational stud- 
ies, the influence of the subhalo distribution is likely to be an impor- 
tant part of the measurement; furthermore, the weighting would be 
difficult to perform accurately. This method has been used recently 
by Bailin & Steinmetz (2005). 



This has the same eigenvectors as I, and the eigenvalues per unit 
mass give the squares of the ellipsoid axis lengths directly. The 
two tensors are related through Ijj = Tr(M)<5, ; - Mij, and the 
quadrupole tensor is often referred to as the inertia tensor in the 
literature (Binney & Tremaine 2008; Zhang et al. 2009; Bett et al. 
2007, 2010). We shall refer to this as the simple inertia tensor Ms m p 
for brevity. 

This tensor has been very widely used in the literature; other 
recent users include Faltenbacher et al. (2002), Kasun & Evrard 
(2005), Hopkins, Bahcall & Bode (2005), Shaw et al. (2006), 
Altay, Colberg & Croft (2006), Hahn et al. (2007b,a), 
Heller, Shlosman & Athanassoula (2007) Romano-Dfaz et al. 
(2009), and Jeeson-Daniel et al. (2011). 



2.2.2 The reduced inertia tensor 



2.2.3 The iterative simple inertia tensor 

In addition to the two preceding direct methods, iterative methods 
based on the same principles are also often used. The procedure we 
use is the following (e.g. Katz 1991): 

(i) Compute the inertia tensor Ms mp using all the halo's particles, 
yielding initial axis lengths a, b, c. This initial halo has a radius R. 

(ii) Select the particles within the ellipsoid just defined, i.e. only 
the particles for which the elliptical distance satisfies 4 



'p,i 



(4) 



where we use the axis ratios q = b/a and s = c/a. 
(iii) Using this new particle set, recompute M Smp . 



A commonly-used variation on the simple inertia tensor is to coun- 
terweight each particle by its distance from the centre, i.e. use the 
tensor with components 



This corresponds to keeping the major axis a constant. An alternative is 

j 1 !j2 } 3 

to keep the volume constant, using the condition + -^f- + ^ 1 . 

a L b L c L 



WL & Halo shapes 7 



The process is deemed to converge when, after a given iteration k, 



Sk-\ 



< 0.01 and 



9k 



<0.01. 



(5) 



The process is deemed to have failed to converge if it takes more 
than 100 steps, or the shape ellipsoid comprises fewer than 10 par- 
ticles. We denote the resulting tensor after convergence as Ms mp it r . 

This procedure (or close variants of it) is often used in situa- 
tions where the set of particles comprising the object is unknown. 
For example, iterative shape-finding might be used as part of the 
halo-finding algorithm, so that the resulting halo has an ellipsoidal 
boundary that agrees with the measured shape exactly (within a 
given tolerance), rather than the shape ellipsoid being an approxi- 
mation to the real shape of a pre-determined particle set. Another 
important usage case is when ellipsoidal density profiles are re- 
quired, or just the shape profiles themselves. Rather than use spher- 
ical bins in halo radius, one uses the equivalent elliptical radii f, 
and the axis ratios q(f) and s(f) are computed iteratively in each 
bin. 

Neither of these cases are relevant to us here. Our haloes 
are already defined using a more sophisticated method including 
particle proximity (the FOF step), binding energy (the Subfind 
step) and substructure dynamics (the merger tree step), and is the 
same definition used for the B06 galaxy model. (A consequence 
of this is that the halo shapes we measure are relatively crude ap- 
proximations to the actual isodensity surfaces.) Furthermore, us- 
ing shape profiles is beyond the scope of the present paper. How- 
ever, an iterative scheme is still informative, as it may give a re- 
sulting halo shape that is more robust against numerical effects 
like dominance by very few particles. We include it here primar- 
ily to allow comparison between different methods used in the lit- 
erature. This method has recently been used in Paz et al. (2006), 
Maccio etal. (2007); Maccio, Dutton & van den Bosch (2008), 
Munoz-Cuartas et al. (2011), and Lau et al. (2011). 



2.2.4 The iterative reduced inertia tensor 

The iteration scheme described above can be used with a re- 
duced inertia tensor, defined similarly to that in equation (3) 
(Dubinski & Carlberg 1991; Warren et al. 1992): 



v 1 'pJ'p-j 
M 'J = lj m »—p- 



(6) 



where f p is the elliptical distance defined in equation (4). We shall 
refer to the resulting tensor after convergence as M Rdultl . This is, in 
fact, the most common way of using the reduced inertia tensor in 
practice, and has been recently used in Kazantzidis et al. (2004), 
Allgood et al. (2006) and Vera-Ciro et al. (201 1). 



2.2.5 Comparison 

To illustrate the impact that these algorithms make on the halo 
shape measured in simulations, Fig. 3 shows the resulting axis ra- 
tios S = c/a as a function of halo mass (using haloes selected for 
our analysis of the B06 model, i.e. at z — 0.32, and hosting a central 
galaxy from B06 with r < 24.3). The error bars on the medians are 
an estimate of their uncertainty, by analogy with the standard error 
on the mean of a Gaussian: 

Xg4 - Xjo 



where X t is the value at the ith percentile of the distribution in ques- 
tion, made up of N objects (X50 is the median). The error bars only 
become significant at high masses, where there are relatively few 
haloes in each mass bin. 

We find very little difference between the results of the Simple 
and the Simple Iterative shape tensors. The reduced tensor however 
yields significantly more spherical haloes at all masses, confirm- 
ing the idea that much of a halo's asphericity is due to the dis- 
tributions of subhaloes and the outer mass distribution generally. 
The Iterative Reduced shape tensor produces very similar results 
to M Smp , but very slightly more spherical. Previous authors have 
found that, with the advent of simulations able to resolve a signifi- 
cant amount of substructure, iterative methods often failed to con- 
verge (e.g. Jing & Suto 2002; Bailin & Steinmetz 2005; Shaw et al. 
2006). We find that 18.0 and 25.9 per cent of the selected haloes fail 
to converge for the Ms mp it r and MRd u i tr tensors respectively - a sig- 
nificant amount, but a small fraction of the population (of 7 907 290 
haloes). As seen in the figure, their loss does not bias the shape dis- 
tribution significantly. 

Detailed further discussion, and comparison of different 
shape-finding methods, can be found in Jing & Suto (2002), 
Springel, White & Hernquist (2004), Bailin & Steinmetz (2005), 
Vera-Ciro et al. (2011), and the recent paper dedicated to the sub- 
ject by Zemp et al. (201 1). 

It is important to emphasize that, while all of these methods 
are commonly used in the literature to measure halo shapes from 
simulations, they are not all relevant for comparison with obser- 
vational studies. In fact, it is the simple mass quadrupole moment 
tensor Ms mp that is the most directly related to the shear signal from 
weak lensing (e.g. Schneider & Bartelmann 1997). In the sense 
that the Ms mp nr is a more robust description of the same moments 
of inertia as Ms mp , then it is also important to note if or when it 
yields singnificantly different results. However, the "reduced" ten- 
sors Mrju and MRduitr, while providing very important measures of 
the physical halo shape, are much less accessible to observational 
tests by weak lensing. We include them here primarily to illustrate 
the systematic impact they have on the results, to aid comparison 
between observational studies and future theoretical predictions. 



2.3 Modelling the orientation of galaxies 

The orientation of galaxies with respect to their dark matter haloes 
is not tackled in current semi-analytic models of galaxy formation. 
We must therefore model galaxy-halo alignment ourselves. 

We consider the central galaxy within a halo, where the galaxy 
minor axis c ga i is oriented in some direction 9 with respect to 
some characteristic halo vector v n , i.e. c ga rVh = |ch||v„| cos 9. In 
our model, we identify v h with either the halo minor axis c h , or 
the angular momentum /. Note that c h and / themselves have 
an alignment distribution, which is not significantly correlated to 
other halo properties such as shape; see e.g. Bett et al. (2007) and 
Skibba & Maccio (2011). We can define a complete set of basis 
vectors for the halo (x h , y h & z h ), identifying the 'z'-axis direction 
Zh with that of v n . The other axes can be formed by rotations of 
90° from that, following the right-hand rule. If v n points at a po- 
lar angle 9 h and azimuthal angle h (with respect to the simulation 
coordinate system, for example), then we have 



X50 - Xi, 



V7v 



(7) 



Zh 



sin 0„ cos 0„ 
sin 9 h sin (p h 
, cos0 h 
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Figure 3. Halo 3-D axis ratio s = c/a vs mass for the four shape algorithms. Also plotted are the median values of s in mass bins, with error bars given by 
equation (7). Dashed vertical lines show the masses corresponding to 1000 particles and 11619 particles, the latter being the upper mass limit we use in our 
analysis (10 13 h~ 'Mq). 
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Note that, if v h = c h , then the plane spanned by the basis vectors 
x h & y h is parallel to that of the halo axes a h & ft h . However, we 
do not require that e.g Xh and «h etc. are parallel, as our modelling 
of the galaxy orientation is based solely on the direction of v h . The 
orientation of any given halo shape with respect to its Vh (and hence 
ih & y h ) is fixed, and we do not need to specify it explicitly in our 
modelling. 

In the same way as for our halo coordinates, we let 6 and <f> 
describe the polar coordinates giving the orientation of the galaxy 
minor axis c ga i, with respect to this halo reference frame. We choose 
the 6 and <p by randomly sampling from different distributions. As 
we see no convincing physical reason for there being a preferred 
angle for <p, we sample it from a uniform distribution between 
and 2n. However, we test four different models for the galaxy-halo 
alignment angle 9, which we describe below. 

Using these two angles we can define a set of basis vectors for 
the galaxy (Jc ga i, y glll , z ga i) in the same way as equations (8) above. 
However, if we consider the galaxy, like the halo, to be a triaxial 
ellipsoid, then we need a third angle £ to define the orientation of 
a ga i and b glll on the x sa i-j„ al plane. Like <f>, there is no convincing 
reason for there to be a strongly-preferred value of so we again 
randomly sample it from a uniform distribution over 0-2n. It is im- 
portant to note that <f is still significant even in the case of a disc 
galaxy with a ga i = b ga \. This is because we define our "image plane" 



later based on the galaxy's « ga i-c ga i plane, so £ has a strong impact 
on the orientation of the projected halo (see section 2.4). We give 
more mathematical details of the rotations involved in implement- 
ing our orientation model in Appendix B. 

We now go on to describe the four models we use to pro- 
vide distributions of the galaxy-halo alignment angle 6. It should 
be noted that we do not expect that the "true" alignment distribu- 
tion to match any of these models in detail. Rather, our intention is 
that they span the possibilities of galaxy-halo alignment, such that 
the impact of any given model can be easily understood in obser- 
vational terms. 



2.3.1 Parallel 

In this model, we take the characteristic halo vector to be its minor 
axis (v h = Ch), and set the galaxy minor axis to be perfectly aligned 
with it; i.e. the angle between c h and c ga i is 6 = 0. This is the most 
optimistic, 'best-case' scenario for attempts to measure halo shape. 

Note however that even in this case, due to our random sam- 
pling of 0, £ and the inclination of the image plane (see later), the 
ellipticity of the projected shape can vary, and it can be misaligned 
with respect to the galaxy. 



2.3.2 Uniform 

In this case, the orientation of the galaxy with respect to the halo 
is uniformly distributed, i.e. the probability distribution of cos 6 is 
flat over the range [-1,1]. This is the worst case scenario for halo 
shape measurements. 
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2.3.3 Fit to simulations 

In the study of weak lensing with COMBO- 17 data, Heymans et al. 
(2004) used a truncated Gaussian distribution to very 
roughly fit the galaxy-halo alignment from the simulations 
of van den Bosch et al. (2002), which used dark matter and 
non-radiative gas. In more recent years, the probability distribution 
for galaxy-halo alignment has been measured in more advanced 
hydrodynamic simulations, which include radiative cooling, star 
formation and feedback processes. Furthermore, we can fit them 
using functions more suited to the 3-D polar angle that we are 
measuring. 

We model the galaxy-halo alignment based on the spin- 
spin alignment shown in Bett et al. (2010) (their fig. 17) and 
Deason et al. (2011) (the top-right panel in their fig. 3), which are 
based on the simulations of Okamoto et al. (2005) and the Gimic 
simulations (Crain et al. 2009), respectively. We assume that c gii i 
is parallel to the galaxy spin axis (Libeskind et al. 2007; Bett et al. 
2010). Bett et al. (2010) measure the orientation of their galaxies 
with respect to their parent haloes in the galaxy formation sim- 
ulation ('DMG') and also a dark matter-only resimulation of the 
same initial conditions ('DMO'). We consider both here, giving 
us three different datasets in total: there are 431 galaxy-halo sys- 
tems in the Deason et al. (201 1) data, 99 systems from Bett et al. 
(2010) DMG, and 95 from their DMO simulation. Despite the dif- 
ferences in the physics used in the different simulations, we find 
that a Kolmogorov-Smirnov test fails to show a significant differ- 
ence between the three datasets at a 5% significance level, i.e. they 
are consistent with having been drawn from the same distibution. 

We use a Fisher (1953) distribution averaged over the az- 
imuthal angle tp to characterise the alignment probability given by 
the data. (We describe this distribution in more detail in Appendix 
A.) The probability density function (PDF) is given by 

P(cos#) = /o(/csin#sin$o) exp (/ccos Scos O )> (9) 

2 sinh k 

in terms of the "mean" direction 6 and the concentration k, which 
we write in terms of the distribution width cr = 1/ yfic. (/ is the 
zeroth-order modified Bessel function of the first kind.) We find 
that sufficiently accurate values for the mean direction and width 
arc 

g = 0.0, cr = 0.55. (10) 

We show the three distributions and this fitted PDF in Fig. 4. Note 
that although the preferred direction 8q = 0, the median value of 6 
for this distribution is actually 37.7°. 

We therefore define our third galaxy-halo alignment model as 
the PDF given in equation (9) with the parameters given in equa- 
tion (10), using the halo angular momentum as the reference vec- 
tor v h = /. Note that the distributions in the Parallel and Uniform 
alignment models are limiting cases of equation (9), for cr — > and 
cr — > oo respectively (assuming / and c h are parallel). 



2.3.4 Split distribution 

Finally, we use a model for galaxy-halo alignment that explicitly 
differentiates between different types of galaxies. 

The strong link between the angular momenta of gas and dark 
matter, and the formation of disc galaxies, leads us to link their 
alignment to the halo angular momentum. On the other hand, ellpit- 
ical galaxies are usually considered to have formed through merg- 
ers, which will randomise their orientation (e.g. Scannapieco et al. 



2009; Romano-Diaz et al. 2009; Bett & Frenk 2011, and refer- 
ences therein). However, the galaxy will subsequently accrete fur- 
ther material, which could come from certain directions preferen- 
tially (e.g. along filaments). The same is true for the halo, and 
we can consider the same directed merger and accretion events 
that determine the halo shape to influence the galaxy shape in 
a similar way. This suggests that it is reasonable to set bulge- 
dominated galaxies to be aligned to their halo. Observational stud- 
ies have given evidence for good alignment of early-type galaxies 
and their haloes (e.g. Kochanek 2002, 2006; Cypriano et al. 2004; 
Ferreras, Saha & Buries 2008; but see Okumura, Jing & Li 2009). 

Heavens, Refregier & Heymans (2000) introduced a simple 
model for galaxy-halo alignment, in which elliptical galaxies 
were (implicitly) co-aligned with their halo, and disc galaxies 
were aligned parallel to their halo's angular momentum vec- 
tor. This model was also used in conjunction with the Millen- 
nium Simulation in the work on satellite galaxy alignments of 
Agustsson & Brainerd (2010). Heymans et al. (2004, 2006) ex- 
tended this alignment model to allow for a misalignment distri- 
bution around the halo angular momentum vector, following the 
results from van den Bosch et al. (2002) as already discussed. 

Given the obvious similarities between our alignment models 
and that of Heymans et al. (2004), we construct our Split model in 
the same way. Using the stellar-mass bulge-to-total ratio B/T as a 
physical measure of galaxy morphology, we apply the Fitted align- 
ment model for galaxies with B/T < 0.5 (i.e. using equations (9) 
& (10) to sample 6 with respect to the halo angular momentum, 
for disc-dominated galaxies), and the Parallel alignment model for 
galaxies with B/T > 0.5 (i.e. = with respect to the halo shape, 
for bulge-dominated galaxies). 

2.4 The image plane 

We consider our galaxy-halo systems as lenses, and that, when 
stacked, their mass distributions will be measurable through weak 
lensing of the shapes of background source galaxies. We don't need 
to actually perform the lensing itself, as we are most interested in 
how galaxy-halo alignment affects the projected mass distribution; 
for our purposes, the lensing process would mostly serve to add 
noise to the halo ellipticity signal, making the stacked halo appear 
more circular. 

We assume that the observer will try to align their lens galaxies 
in an image plane such that minor axis of the galaxy is parallel to 
the image y-axis, and the galaxy major axis is parallel to the image 
x-axis 5 . As galaxies will not be exactly edge-on when observed on 
the sky, we have to allow for some variation in inclination angle 
(, which we define as a rotation about the galaxy major axis (the 
image x-axis), such that the image-plane normal vector is rotated 
above/below the galaxy intermediate axis; the galaxy minor axis is 
rotated in front of or behind the image plane, no longer parallel to 
the image y-axis. We sample ( from a uniform distribution over the 
range ±30°. 

Note that, even in the case of perfect galaxy-halo alignment 
in 3-D (6 = 0), the combination of the non-zero azimuthal angle <f>, 
galaxy orientation and image-plane inclination ( results in mis- 
alignment between the projected galaxy and halo, and variation in 

5 In practice for our model, aligning with the galaxy intermediate axis par- 
allel to the image .v axis is equivalent to having ^ = ±90°, i.e. the uniform 
distribution of f already accounts for the uncertainty in differentiating be- 
tween these axes observationally. 
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Figure 4. Galaxy-halo alignment distributions, from Deason et al. 2011 (their fig. 3) and Bett et al. 2010 (their fig. 17). Angles are between the halo and 
galaxy angular momentum vectors. The black curve is the azimuthally-averaged Fisher distribution given by equations (9) & (10). Medians are marked with 
vertical dashed lines. 



the ellipticity of the projected halo itself. Perfect alignment in 3-D 
need not mean perfect alignment in projection. 



2.5 Stacking 

In observations, in order to obtain a measurable signal above the 
noise from single measurements, the shear signal from many galaxy 
images must be stacked, with the resulting shape being that of the 
net mass distribution. In practice, it is sensible to scale the galaxy- 
halo systems to ensure that they are compared fairly, and the signal 
does not become dominated by few very large systems. This might 
be done according to some spatial scale on the galaxy images, an 
assumed mass content, or more directly by luminosity. In our case, 
care must be taken to use an appropriate weighting when summing 
(stacking) the shape tensors of haloes. The shape tensors we use are 
themselves wieghted differently: Ms mp and Ms mp itr scale with halo 
mass and square radius (and are thus their sum is very susceptable 
to dominance by high-mass haloes), whereas M R(iu and M R(iultr just 
scale with halo mass. 

We choose to stack halo shapes weighting by galaxy r-band 
luminosity L r , with respect to some constant reference luminosity 
L r0 . For a given shape tensor definition M, 



*-'r,c 



(ID 



where the sum is over selected galaxy-halo systems <p. The choice 
of L r> o is not important. Since we obtain luminosities from magni- 
tudes, M r0 — M r = -2.5 log l0 (L r0 /L r ), we simply choose M r0 = 
such that L ., /L r = 10 M,/25 . Note that this will not be possible ob- 
servationally if redshift information is not available. Instead, the 
strong weighting of the shape measurement towards large haloes 
would be retained. Even with photometric redshifts, such an obser- 
vational study might choose to calculate the halo shapes in lumi- 
nosity bins, rather than use luminosity to scale the data from each 
lens (Mandelbaum et al. 2006). 



2.6 Summary 

We have four alignment models (Parallel, Uniform, Fitted and 
Split) that define the galaxy-halo alignment angle 6, together with 
random sampling for the azimuthal angle 0, galaxy orientation £ 



about its minor axis, and (over a restricted range) the image plane 
inclination We also use four methods for measuring halo shape 
(by the tensors M Smp , M Sm pi tt , M Rdll and M RdllI(1 ). We are using a 
single algorithm for defining the haloes, and the publicly-available 
results from two semi-analytic galaxy formation models (DLB07 
and B06). 

After assigning values for 9, <p, we rotate the halo shape 
matrix in question into the image plane. We obtain the eigenvalues 
and eigenvectors of its projection onto the image plane, giving us 
the projected halo shape axes a pT and b pl . We can measure the circu- 
larity of the haloes in projection through the axis ratio q pl = b pl /a pr . 

Note that for the Parallel, Uniform and Fit alignment models 
there is no link between galaxy properties and alignment. However, 
since both halo shape and galaxy properties depend on the merger 
history of the halo, it is possible that halo shapes can be correlated 
to galaxy properties: in principle, one could be able to select galax- 
ies that preferentially have less-spherical haloes. 



3 MONTE CARLO TESTS 

To directly test the impact of our alignment models and the halo 
shape distribution on the resulting stacked shapes, we perform 
Monte Carlo experiments to construct a halo-galaxy sample, with- 
out using the simulation or semi-analytic model. 

To generate a halo population, we sample the 3-D axis ratio 
.v from a Gaussian probability distribution based on the results of 
Allgood et al. (2006). We take the standard deviation of the Gaus- 
sian to be cr. = 0. 1 , and take the mean to be 



\M, 



(12) 



where a = 0.54, B = -0.050, and M is the halo mass. The charac- 
teristic mass M,(z) is given by 

log 10 [M,/( /r'M Q )] = A - Slog 10 (l + z) - C (log 10 (l + z)f (13) 

with A = 12.9, B = 2.68, and C = 5.96. For the purposes of these 
tests, we take a constant halo mass M = 10 12 /t'Mq. Using our 
standard redshift of z - 0.32, we obtain M, = 3.09 x 10 12 /t'Mq, 
leading to a distribution with a mean sphericity of (s) = 0.571. For 
each value of s, we assign an intermediate axis ratio of q = b/a = 

(l+j)/2. 
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Figure 5. Shape of stacked, projected haloes generated by sampling their 
3-D shapes from a Gaussian, then sampling their orientation with respect to 
an image plane in the usual way; see text for details. The continuous line 
joins the results from alignment distributions with a range of widths cr (see 
equation 9), with the additional points (the circle and triangle) representing 
the limiting cases of cr — > and cr — » <x> respectively. 



Using this halo shape distribution, we then generate samples 
of projected halo shapes, each comprising 10 6 objects. We generate 
one sample each using the Parallel and Uniform alignment models, 
and a series of samples based on the Fitted model. In the latter case, 
we choose a different value of the alignment distribution width cr 
for each sample. We do not model the halo angular momentum, and 
instead take the alignment distribution to always be with respect to 
the halo shape. We retain the variability in image plane alignment 
of ±30°. 

We stack these projected haloes directly, giving M tol = £ M f ; 
since the haloes are all the same size, we need not (and cannot!) 
weight by galaxy luminosity. 

The results, showing how the resulting stacked halo shape de- 
pends on the alignment distribution width, are shown in Fig. 5. The 
stacked halo shape quickly changes from q pl ss 0.68 for Parallel 
alignment, through q pl > 0.9 for cr > 0.6, and converging to the 
result from the Uniform distribution by cr as 2. 

We have also investigated the joint impact of the original halo 
shape distribution and the alignment distribution width. For this, we 
did not sample halo shapes from a Gaussian, but instead set them 
all to a fixed value s. The orientation distributions were randomly 
sampled in the same way as before, for a grid of values of s and 
cr. The results are shown in Fig. 6. It shows that, as expected, the 
sphericity of the halo population is largely immaterial, unless the 
alignment distribution has cr < 0.5. Even in that case, one needs a 
strongly aspherical shape distribution, with s < 0.3 in order to get 
a stacked shape of q pT < 0.8. 

These Monte Carlo tests have shown quantitatively the sen- 
sitivity of the stacked halo shape on the form of the galaxy-halo 
alignment distribution. Thus we expect that, if galaxies are aligned 
randomly in their haloes, or even if they are aligned as found in 
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Figure 6. The contours and shading show the stacked axis ratio q pI from 
random-sampling halo orientations in the usual manner, from distributions 
with width cr, and haloes with 3-D axis ratio s. The lowest contour (in the 
red region) is at q pr = 0.35; the other contours increase in steps of 0.05. 



recent hydrodynamic simulations, then the stacked halo shape will 
be > 0.9. 



4 RESULTS 

The results for the axis ratios q pI = b p Ja pi of the stacked projected 
halo shapes are shown in Fig. 7 for the B06 model, and Fig. 8 for 
the DLB07 model. Each point represents the result for a given com- 
bination of models, with the different columns showing the effect of 
different selection criteria. We now go on to examine these results 
in detail. 

A quick glance confirms that the primary factor in determining 
the measured stacked halo shape is the galaxy-halo alignment dis- 
tribution. When the Uniform model is used (triangles in the plots), 
the stacking process washes out any intrinsic halo ellipticity, and 
the stacked halo is circular. The maximum deviation from circu- 
larity comes when the Parallel alignment model is applied, as this 
allows the maximal contribution from all haloes towards the final 
shape. 

In the Parallel case, there are significant differences caused 
by the different halo-shape algorithms. When Ms mp is used, the 
result is furthest from circular, with larger axis ratios generated 
when M Rdu is used. The iterative methods give moderated values of 
these extremes: using Msmpitr yields slightly more circular haloes 
than Ms m p, and using M R(illltr yields slightly less circular haloes 
than M Rdu . This difference is due to the different implicit weight- 
ing that these methods give to haloes when stacking. The reduced 
tensors have their dependence on halo size (radius) scaled out, so 
that haloes contribute proportionally to their mass (which we then 
reduce by counterweighting by luminosity). The simple inertia ten- 
sors however retain the additional square-radius dependence. This 
means that the stacked halo results are much more strongly dom- 
inated by high-mass objects in the simple inertia tensor case, but 
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Figure 7. Axis ratios of the resulting projected shape tensors computed from stacking haloes selected according to different criteria. This plot uses our 
standard redshift of z — 0.32 and the B06 semi-analytic model. Each of the four panels gives the results from using a different halo shape algorithm, and each 
symbol type gives the result from different galaxy-halo alignment models. The first column in each panel ("All") gives the results for all selected haloes with 
M < 10 13 /i~'Mq. The next column adds a restriction at low masses, excluding haloes comprising < 1000 particles. The third and fourth columns split the halo 
population from the "All" column by colour and morphology respectively, with the blue symbols representing the blue/disc case, and red symbols representing 
the red/elliptical case. The colour cut is made at rest-frame u — r = 0.9, and the morphology cut is made at B/T = 0.5. 
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Figure 8. As Fig. 7, but using the DLB07 semi-analytic model. The colour-cut is therefore made at observer-frame u - r = 3.5. 
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Table 1. The number of galaxy-halo systems for the different models and selections in Figs. 7 & 8. 



Model 


Shapes 


All 


N p > 1000 


Blue 


Red 


Disc 


Elliptical 




Msmp, MRdu 


7 866537 


443 053 


7 064928 


801 609 


6890492 


976 045 


B06 


Msmpltr 


6454412 


320280 


5 869 870 


584542 


5 738 275 


716137 




MRduItl 


5 830433 


318931 


5 249 670 


580763 


5 1 16 628 


713 805 


DLB07 


Msmp, M Rdll 

Msmpltr 
MRduItl 


10710174 
9034263 
7 887 973 


442 862 
320 177 
318797 


10460421 
8 842 123 
7 707 025 


249 753 
192 140 
180948 


10241493 
8 660 850 
7 542 620 


468681 
373 413 
345 353 



are more evenly weighted in the reduced case. How this effects the 
results depends on how the intrinsic halo shape distribution varies 
with mass for the different algorithms, which we showed earlier in 
Fig. 3: higher mass haloes tend to be less spherical. 

A numerical artefact, present in A'-body simulations such as 
the MS, is that haloes consisting of a small number of particles tend 
appear systematically less spherical than those with many particles. 
A lower limit of around 300 particles was suggested for the MS in 
Bett et al. (2007) to ensure accurate halo shapes. In Figs. 7 and 8, 
we compare the results from all haloes and those with at least 1000 
particles (in all cases, our upper mass limit of M < 10 13 /i~'Mq 
applies). We see that excluding the low-mass haloes makes only 
a very small difference to the stacked result: For the Ms rap and 
Msmpiti algorithms, high-mass haloes dominate the stacking any- 
way, and for MRd u and MRd u i tl the fact that the numerical biasing at 
low masses is in the same direction as the natural trend going to 
high masses leaves negligible net effect. 

If, without numerical constraints, haloes in fact continue 
to get more spherical towards lower masses (as suggested by 
Maccio, Dutton & van den Bosch 2008 and Munoz-Cuartas et al. 
2011), then the effect of retaining lower masses in the stacking 
would be more important: when using the reduced inertia tensor, 
the result for "all" haloes in our figure would be more circular. 

4.1 Split alignment and the morphological mix 

The Split alignment model shows the greatest variation among the 
models and selections tested. Since in this model the galaxy-halo 
alignment depends explicitly on galaxy morphology, the stacked 
results when selecting by morphology are entirely predictable: For 
elliptical galaxies, the result mirrors that from the Parallel align- 
ment model, whereas for disc galaxies it mirrors that of the Fitted 
alignment model. For the other selections, the result depends on 
how the distribution of galaxy morphologies relates to the quantity 
used for selection. 

Even the data for 'All" systems shows significant variation 
between the B06 and DLB07 models, and for different shape algo- 
rithms. Furthermore, excluding low particle-number systems has a 
significant impact in the B06 model, but not in the DLB07 model. 
We therefore need to examine how the galactic morphological mix 
varies with halo mass in the two models. 

This is shown in Fig. 9. We can clearly see that, while both 
semi-analytic models are dominated by disc galaxies at low halo 
masses, they show very different bahaviour for higer-mass haloes. 
In the B06 model, the proportion of galaxies that are discs falls 
rapidly with increasing halo mass, such that the galaxy popula- 
tion at high masses is dominated by elliptical galaxies. In the 
DLB07 model however, the population remains dominated by disc 
galaxies for roughly another decade in mass, and only drops to 
a roughly even spread between discs and ellipticals. (Note that 



Parry, Eke & Frenk (2009) have studied in detail the origin and 
evolution of galaxy morphologies in the B06 and DLB07 models.) 

This is reflected in the results for the stacked halo shape. When 
the Ms m p tensor is used, the stacked shape is weighted towards the 
high-mass haloes. In the B06 model, this means that the majority 
of strongly-contributing haloes host elliptical central galaxies, with 
the Parallel alignment model. There is still a significant number of 
disc galaxies however, and their Fitted alignment model means that 
the net stacked shape is more circular than if the Parallel alignment 
model was used alone. In the DLB07 model, the galaxy morpholo- 
gies are even more mixed, with the additional misalignment result- 
ing in a more circular stacked result. 

On the other hand, if a reduced shape tensor is used, then in 
all cases the haloes are weighted more equally. Those dominant by 
number are at low masses, which are dominated by disc galaxies, 
using the Fitted alignment distribution. Because the steep drop in 
the fraction of disc-dominated galaxies happens at lower masses in 
the B06 model, that model has fewer disc galaxies and results in a 
less circular stacked halo shape. 

When considering how the Split model operates when se- 
lecting systems by galaxy colour, we need to examine the re- 
lationship between colour and morphology. Although they are 
classically seen to correlate well (e.g. de Vaucouleurs 1961; 
Simien & de Vaucouleurs 1986; Strateva et al. 2001; Bell et al. 
2004), both theoretical and observational studies have shown 
the relationship to be not straightforward, and based on de- 
tailed processes occurring during galaxy formation and evolu- 
tion (Crotonetal. 2006; Benson et al. 2007; Deng et al. 2007; 
Guo et al. 201 1 ). We show the relationships between galaxy colour 
and morphology for our models in Figs. 10 & 11. 

In the B06 model (Fig. 10), it is clear that blue galaxies are 
mainly discs, and disc galaxies are mainly blue. Elliptical galaxies 
have a broader range of colours, albeit dominated by red galax- 
ies. This means that, in the stacked halo results, selecting just 
blue galaxies yields a much more circular shape than selecting just 
red galaxies, although the mixing between colour and morphology 
means that the situation in both cases is less extreme than when 
selecting by morphology directly. 

For the DLB07 model (Fig. 11), the population remains domi- 
nated by disc galaxies when either blue or red galaxies are selected, 
although it is to a lesser extent in the red case. Similarly, both the 
disc galaxy and elliptical galaxy populations are dominated by blue 
galaxies, with the elliptical galaxy population having a significant 
red galaxy presence too. This results in a much smaller difference 
between the stacked halo shapes when selecting just red and just 
blue galaxies, when compared to the results from the B06 model; in 
particular, the result for the red population is significantly more cir- 
cular. However, the tight link between blue and disc galaxies means 
that for both the B06 and DLB07 model, selecting blue galaxies 
yields the more circular stacked halo. 
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Figure 9. Distribution of galaxy morphologies as a function of halo mass, for the B06 and DLB07 semi-analytic models. The left panel shows the proportion 
of galaxies at each mass that have B/T < 0.5 (i.e. discs), with Poisson error bars. The middle and right panels show the (normalised) histograms of the different 
galaxy samples. Note that the plots extend beyond the upper mass limit of 10 13 h~ i MQ that we use in our analysis, and below the lower limit of 1000 particles 
that we consider; both limits are marked with vertical dashed lines. 
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Figure 10. Distribution of galaxy morphologies as a function of colour, 
for the B06 model. The top panel gives the fraction at each colour that are 
disc-dominated, with Poisson error bars. The bottom panel shows colour 
histograms for each population sample. The colour cut at rest-frame u — r — 
0.9 is marked with a vertical dashed line. 
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Figure 11. As Fig. 10, but for the DLB07 semi-analytic model; colours are 
therefore observer-frame. The colour-cut is marked in the lower panel at 
u-r = 3.5. 
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Figure 12. Joint distribution of 3-D halo shapes and the colours of their 
central galaxy. Shapes are computed using the the Ms mp tensor (left pan- 
els) and the Mr,j u tensor (right). Top row: Results from the B06 model, us- 
ing rest-frame colours. Bottom row: Results from the DLB07 model, using 
observer-frame colours. The vertical dashed line marks the red-blue divi- 
sion in both cases, and the horizontal solid lines mark the medians for red 
and blue galaxies separately. 

4.2 Parallel & Fitted alignment, and galaxy-halo 
correlations 

The results in Figs. 7 & 8 for the Parallel model (and to a lesser ex- 
tent the Fitted model) also depend on the colour and morphological 
selection, with red or elliptical galaxies resulting in a more circular 
stacked halo than blue or disc galaxies. In this case, the difference 
is not due to the alignment model, but an intrinsic correlation be- 
tween galaxy colour/morphology and halo shape. 

Figure 12 shows the joint distribution of 3-D halo sphericity 5 
from the M Smp and M Rdu tensors, against u-r colour for the B06 and 
DLB07 semi-analytic models. The projected halo shapes in these 
cases, for the Parallel alignment model, are shown against colour 
in Fig. 13. We can see that the colour distribution is very broad for 
any given halo shape, although the haloes of blue galaxies have a 
less-spherical median shape than those of the red galaxies. Sim- 
ilarly, we plot projected axis ratio histograms in Fig. 14, cut by 
galaxy morphology: ellpitical galaxies have slightly more circular 
projected haloes in the median than disc galaxies. It is important to 
note that the median shape in a distribution is not the same as the 
stacked halo shape from the same sample of haloes, because the 
stacking process weights haloes differently. 

While most of the results for Parallel alignment are very much 
consistent between the B06 and DLB07 models, the projected 
shapes for blue (and disc-dominated) galaxies using the Ms mp ten- 
sor are noticeably less circular for the B06 model. This is again due 
to the M Smp shapes being dominated by the very largest haloes: In 
the B06 model, there are very few large haloes hosting blue cen- 
tral galaxies (just 242 systems with masses 10' 25 -10 13 /t'Mq), 
whereas with the DLB07 model the blue population extends to 
much higher masses (in the same mass bin, there are 29 905 
haloes). We show the distributions of galaxies of different colours, 
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Figure 13. As Fig. 12, but using the projected halo shapes, assuming the 
Parallel alignment model. 
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Figure 14. Histograms of projected halo shapes, assuming the Parallel 
alignment distribution, divided according to galaxy morphology: the dis- 
tribution for elliptical galaxies is shown in red, and that of disc galaxies is 
shown in blue. The result from the full population is shown in black, but 
this closely matches that of the disc sample. As in Figs. 12 & 13, we show 
the results from both semi-analytic galaxy models, and the Ms mp & Mr^u 
shape measurement tensors. Medians for each sample are shown by vertical 
dashed lines. 
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Figure 15. Joint distributions of projected halo axis ratio (assuming the Par- 
allel distribution and using the Msmp tensor), with halo mass. Each panel 
shows the results from both semi-analytic models, with the upper panel 
showing the blue population, and the lower panel showing the red popu- 
lation. The solid line joins points giving the median in mass bins, with error 
bars given by the standard error on the median (equation 7). The same con- 
tour levels are used for both models in both panels. 



as functions of their parent halo mass, in Fig. 15. The figure shows 
clearly how the highest-mass haloes (up to 10 13 /t'Mq) hosting 
blue galaxies in the B06 model have a lower median projected axis 
ratio than those in the DLB07 model. Furthermore, despite the sig- 
nificant differences in the distributions of red galaxies between the 
B06 and DLB07 models, the medians as a function of mass are very 
similar. 

It is important to note that the correlation between galaxy 
properties and halo shape is relatively weak, and (as shown from the 
Monte Carlo tests earlier, Fig. 6) only plays a role when the impact 
of galaxy-halo misalignment is strongly reduced. Applying the Fit- 
ted distribution provides an example of an intermediate case: while 
there is still some variation between the results for red/elliptical 
and blue/disc galaxies, the circularising effect of galaxy-halo mis- 
alignment means that these differences are negligable. Indeed, the 
stacked halo results are sufficiently circular under the Fitted distri- 
bution (q pl > 0.95) that it would be extremely difficult for a weak 
lensing study to measure any significant ellipticity. 



5 DISCUSSION AND CONCLUSIONS 

Natarajan & Refregier (2000) and Brainerd & Wright (2000) de- 
scribed a technique for measuring halo ellipticity using the az- 
imuthal variation in the tangential shear signal. Since this is a weak 
variation on top of the already weak shear signal, any measurement 
would require very large numbers of galaxies, from large surveys. 
In this paper, we have shown quantitatively, for a range of possi- 
ble models, that a measurement of anisotropy would be extremely 
difficult indeed. For example, using a model based on current hy- 
drodynamic galaxy formation simulations yields stacked halo axis 
ratios of q pl > 0.95, which would be extremely difficult to observe. 

The main problem, as expected, is the galaxy-halo misalign- 
ment. Any intrinsic correlations between halo shape and galaxy 
properties are only relevant in the case of unrealistically perfect 
alignment. While this can be seen in our main results from using 
the Millennium Simulation, it is very clearly demonstrated in just 
using simple Monte Carlo tests. 

It is important to note, when considering observations related 
to galaxy-halo alignment, that the physically relevant angle 6, as 
measured from simulations, is the alignment in 3-D. Due to the 
random orientations of the other angles in the system (galaxy minor 
axis polar angle <f>, galaxy orientation £ and image plane inclination 
f), even parallel alignment with 6 = does not necessarily lead 
to perfect alignment of the observationally relevant angle, between 
the projected axes. 

In both of the semi-analytic galaxy formation models we test 
here, we find that blue or disc-dominated galaxies tend to reside in 
less-spherical haloes. However, if we assume that elliptical galaxies 
are aligned more closely to their halo than disc galaxies (follow- 
ing e.g. Heymans et al. 2004), then it is selecting red or elliptical 
galaxies that yields a more elliptical stacked halo in projection. In 
our work, this of course occurs by construction, and further work on 
galaxy-halo alignment in simulations is required to see how plausi- 
ble this is in practice. However, the alignment distributions we use 
span the range of possibilities, and, in conjunction with our Monte 
Carlo tests and the colour/morphology distributions of modelled 
galaxies, the result of any arbitrary alignment distribution can be 
estimated. 

The work presented in this paper can be seen as a dark matter- 
theoretical counterpart to the paper of Howell & Brainerd (2010), 
who performed a thorough study of difficulties with measuring 
anisotropic shear from a lensing-theoretical standpoint. Our distri- 
butions for the projected halo axis ratio (Figs. 12-15) derive from 
more complex modelling for example, and show the possible vari- 
ability due to galaxy type, but Howell & Brainerd (2010) take a 
given distribution through a Monte Carlo lensing process, demon- 
strating that multiple deflections of background galaxies can also 
have a catastrophic effect on the anisotropic shear signal. 

Our results may lead one to wonder how it was that both 
Hoekstra, Yee & Gladders (2004) and Parker et al. (2007) managed 
to claim a measurement of halo ellipticity. It should first be men- 
tioned that their detections are relatively weak anyway, with their 
halo ellipticity measurements being at the l-2<r level (although 
Hoekstra, Yee & Gladders 2004 rule out spherical haloes at the 3cr 
level). Possible sources of the discrepancy include underestimation 
of systematic errors in the modelling and interpretation of the data, 
not helped by the lack of redshift and colour/morphology data in 
both of these studies. Mandelbaum et al. (2006) discuss in some 
detail the discrepancy between their non-detection and the results 
of Hoekstra, Yee & Gladders (2004). It is clear that, regardless of 
the physics-related problems highlighted in the present work, con- 
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trolling systematics in observational studies such as these is very 
difficult. 

We have already mentioned the difficulty in using this method 
as a way of distinguishing between ACDM and modified gravity 
theories. The basic idea - ACDM predicts non-spherical haloes, 
but modifications to gravity without dark matter predict spherical 
symmetry - is based on a naive understanding of practical issues in 
both the ACDM and modified gravity case. With ACDM, system- 
atic effects in the lensing and stacking procedure can easily render 
the net signal isotropic, and as we have shown, our poor knowledge 
of the relationship (and in particular the alignment) between galax- 
ies and their parent haloes provides much of the uncertainty. In the 
case of galaxies in modified gravity theories, all the aforementioned 
problems with our understanding of the baryonic physics still ap- 
ply, but in the context of gravity laws that are more complex and 
less well understood. There has been some simulation work with 
gas dynamics in MOND (Tiret & Combes 2008), but no full simu- 
lation of galaxy formation in a cosmological context, with any star 
formation or feedback (simulations in ACDM have shown how sig- 
nificant an impact these processes have on the resulting galaxies). 
In the context of STVG, the gravity law is more difficult to work 
with, and numerical simulations are still in their infancy (see e.g. 
Moffat & Toth 2010). Thus, we do not believe that statistical anal- 
ysis of stacked, projected lens galaxies can be used to discriminate 
between ACDM and alternative theories, simply because we lack 
robust predictions from either case. 

Consequently, in this paper, we have not gone as far as to 
make a prediction for observations, as the theoretical uncertainty 
is still too large. In the future however, if the galaxy formation 
models reach better convergence and can offer statistical predic- 
tions of galaxy-halo alignment, then a study such as ours could be 
advanced further to make such an observational prediction. In that 
case, certain other effects would need to be taken in to account. We 
have been able to neglect these here, as they are all secondary to 
the main misalignment difficulty. 

Firstly, when computing shapes from simulations to compare 
with observations, then it would be more appropriate to use the 
mass within a given radius. This is (arguably) not the same as the 
shape of the halo, which is a dynamically relaxed physical struc- 
ture, rather than a geometrically-defined overdense region. How- 
ever, an observation such as this has no practical way of accessing 
the dynamical information necessary to define a halo, and so there 
is no need to do so in simulations for this purpose either 6 . Ideally, 
the mass distribution as a function of radius would be generated 
from the simulations, as it has been shown that halo ellipticity is 
not constant with radius. The distinction between central and satel- 
lite galaxies could also be relaxed, and the shape of the mass dis- 
tribution around each (lens) galaxy could be computed, allowing 
for selection criteria that more closely match those in observational 
studies. (The problem of which galaxy is at the centre of a halo is 
not limited to simulations; for example, Skibba et al. (2011) have 
shown that the brightest galaxies in haloes are often not in the cen- 
tre, and this should be taken into account when simulating observa- 
tions.) 

When considering the inner halo shape however, it be- 
comes vital to consider the impact of baryonic processes. 



6 It should be noted that in analyses of simulations alone, the distinct ideas 
of measuring the shape of an overdensity contour colocated on the same 
density peak as a halo, and the shape of the dynamically-defined halo itself 
are often conflated. 



While dark matter-only haloes are triaxial with a ten- 
dency for prolateness, becoming more prolate towards the 
centre (e.g. Bett et al. 2007; Hayashi, Navarro & Springel 
2007), haloes that have had a galaxy form in the centre 
are overall more spherical, with a tendency towards oblate- 
ness (Kazantzidis et al, 2004; Kazantzidis, Abadi & Navarro 
2010; Baffin et al. 2005; Berentzen & Shlosman 2006; 
Gustafsson, Fairbairn & Sommer-Larsen 2006; Debattista et al. 
2008; Tisseraetal. 2010; Machado & Athanassoula 2010; 
Abadi et al. 2010; Lau et al. 2011). This is likely to make halo 
shapes more difficult to measure. On the other hand, strong 
lensing studies have suggested that mass and light are well 
aligned in the inner regions of the halo (Kochanek 2002, 2006; 
Minor & Kaplinghat 2008). 

It would be important to measure the alignment distribution 
from a statistically large sample of objects, and over a range of 
time steps. It is known that both halo and galaxy orientations vary 
in time even outside major mergers (e.g. Scannapieco et al. 2009; 
Romano-Diaz et al. 2009; Bett 2010; Bett & Frenk 2011), so the 
relative orientations of a few galaxies and haloes at a single redshift 
might not be at all robust, regardless of how well-resolved they are 
spatially, or how realistic the baryonic physics in the simulation is. 

Eventually, realistic mock-observations would need to be pro- 
duced, using ray tracing through the simulation (e.g. Hilbert et al. 
2009) so that a realistic background source population and the ef- 
fect of multiple deflections are included, as it has been shown that 
these have a significant impact on shear measurements (Brainerd 
2010; Howell & Brainerd 2010; Hoekstra et al. 201 1). 

While our results do not seem to give much cause for opti- 
mism in measuring shapes using weak lensing, it should be pointed 
out that we are only considering one method, for the shape dis- 
tribution of non-cluster haloes. Many other methods of measur- 
ing halo shapes are possible, and indeed are actively persued. Fur- 
thermore, the ideal test of ACDM is to measure the inceasing as- 
phericity of haloes with increasing mass, and thus the shapes of 
cluster haloes are particularly important. Evans & Bridle (2009) 
applied essentially the same technique as Natarajan & Refregier 
(2000), but on clusters rather than field galaxies. They managed 
to measure a projected halo axis ratio of q pT = 0.48J^y 9 14 with 
lcr errors, ruling out a circular shape at 99.6% confidence. Us- 
ing clusters has the practical advantage that the cluster member 
galaxies can be used for alignment. Much work has been done 
on the alignment of the satellite galaxy distribution, both for mea- 
suring cluster halo shapes and as another test of dark matter, ob- 
servationally and in simulations (e.g. Kuhlen, Diemand & Madau 
2007; Faltenbacher et al. 2008; Libeskind et al. 2009; Knebe et al. 
2010; Yang et al. 2006; Wang et al. 2008; Agustsson & Brainerd 
2010, and references therein). Hopkins, Bahcall & Bode (2005) 
also investigated the distribution of projected halo shapes, for sim- 
ulated cluster-mass haloes as a function of redshift. Other cluster- 
based lensing methods attempt to map the shape directly (e.g. 
Cypriano et al. 2004; Ferreras, Saha & Buries 2008; Oguri et al. 
2010; Debetal. 2010, in the latter case finding q pl = 0.54 ± 
0.04 at lcr), or use Markov Chain Monte Carlo methods to 
fit triaxial models (Corless, King & Clowe 2009). Lensing flex- 
ion has recently been proposed as another method for studying 
galaxy-scale haloes (Er & Schneider 2011; Eretal. 2011). Non- 
lensing methods for studying halo shape include studying the 
distribution of Hi in disk galaxies (e.g. Banerjee & Jog 2008; 
O'Brien, Freeman & van der Kruit 2010, and references therein). 

In this paper, we have presented a quantitative analysis of the 
impact of galaxy-halo misalignment on the possiblity of measuring 
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halo shapes via weak lensing in stacked images. We have tested a 
series of alignment models, spanning the range from perfect align- 
ment (in 3-D) to uniformly-distributed alignment. As intermediate 
models, we included a fit to recent hydrodynamic simulations of 
galaxy formation, and a distribution that explicitly differentiates be- 
tween galaxy mophologies. Our results have shown that, for there 
to be a reasonable possiblity for shapes to be measured, a signifi- 
cant fraction of the lens galaxies must have close to perfect align- 
ment, which seems physically implausible. Using simple Monte 
Carlo models, we have quantified how well-aligned the galaxies 
have to be in their haloes before the intrinsic shape distribution be- 
comes measurable. For our results using the Millennium Simula- 
tion, we have also tested the impact of using different models of 
galaxy formation, and different ways of measuring haloes in simu- 
lations. These illustrate some of the difficulties in applying results 
from current simulations directly to models: there simply is not a 
single robust quantitative prediction from ACDM for halo shape 
measurements using this method. Since the same is true for alter- 
native theories without dark matter, this method cannot yet be used 
to falsify one or the other. 
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APPENDIX A: THE AZIMUTHALLY- AVERAGED FISHER 
DISTRIBUTION 

The probability distribution we use in section 2.3.3 to define our 
Fitted alignment distribution is based on the Fisher (1953) distribu- 
tion, 



of the more general von Mises-Fisher family of distributions, and 
is often used as more mathematically tractable approximation to 
a wrapped Normal distribution (see Mardia & Jupp 2000 for more 
details). 

If we write our vectors in a cartesian basis in terms of spherical 
polar coordinates, and (for our case) take the random variable v to 
be the galaxy axis c ea i, oriented with respect to the halo vector v h 
located on the z-axis, we can write 



sin 9 cos < 
sin 9 sin (f 
cos 9 



sin 6>o cos (f>o 
sin 9 sin <p 
cos 9 



(A2) 



Normalisation of the PDF is over the surface of the unit sphere 



of 9 and <f> as: 



= An), so we can write the PDF in terms 



An sinh k 

such that the normalisation integral is 



o Jo 



Pf(9,(f) d(f>d9=l. 



(A3) 



(A4) 



However, in our case, we are only interested in the angle 9 
between our two vectors, so we have to integrate the Fisher distri- 
bution over all values of the azimuthal angle cf>. The ^-integral is in 
fact related to the zeroth-order modified Bessel function of the first 
kind, Iq(x): 

exp [k sin 9 sin 9 cos(0 - tp j] d</> = 2kI (k sin 9 sin 9 Q ). (A5) 

o 

so we can write our azimufhally-averaged Fisher distribution as 



P(9): 



2 sinh k 



Zo(Ksin$sin#o) exp (k cos 9 cos 9 ) sin#. (A6) 



To aid comparison with the uniform distribution, we shall actually 
normalise in cos 9 instead of 9, so the final PDF that we use is: 



P(cos6») : 



I (k sin 9 sin 9 ) exp(/ecos#cos6>o) , 



(A7) 



2 sinh k 
which is equation (9). 

Note that for large widths (cr > 10), the PDF tends to the 
uniform distribution, and for narrow widths the PDF tends to a delta 
function spike at 9 . 

The routine we use to sample from this distribution is based 
on that given in the Prob library 7 of John Burkardt, which in turn 
is based on Fisher, Lewis & Embleton (2003). 



APPENDIX B: ROTATION AND PROJECTION 

Here, for completeness, we give further details of our method for 
defining the rotations and projection involved in our galaxy-halo 
alignment model described in section 2.3. Consider a reference 
frame S, and a second frame S' that is a rotation of S. If we 
write the cartesian basis vectors of 5 ' in terms of those of S , e.g. 
x = (xi , X2, xt,) 1 , then we can simply write down the rotation matrix 
that transforms from S to S': 



P f (v;v ,k) ■■ 



sinh*: 



exp {k v ■ v ) , 



(Al) 



where the probability density function (PDF) is given in terms of 
the unit vector random variable v, the mean direction unit vector 
v , and the concentration k; the latter is often written in terms of the 
width of the distribution cr through k = 1/cr 2 . This is the 3-D case 



X\ Xi X3 

h h S3 
Zi zi h 



(Bl) 



7 http: //people. sc. fsu. edu/~j burkardt/cpp_src/prob/prob. html 



WL & Halo shapes 21 



Thus, given the basis vectors for the halo coordinate frame in 
terms of the simulation coordinates, given in equation (8), we can 
write down the rotation matrix for transforming from the simulation 
coordinates into these halo-based coordinates: 



cos 6>h cos 0h 
■ sin 0h 
sin 9 h cos 4>h 



cos #h sin fa 

COS 0„ 

sin 9 h sin h 



- sin tfh 


cos 9 h 



(B2) 



In this halo frame, we define the orientation of the galaxy by 
first specifying a minor axis vector direction given by 9 and qb. We 
can thus rotate into a similar galaxy-vector-based frame using a 
matrix R ga j identical in form to equation (B2). However, the ori- 
entation of the galaxy about its minor axis must also be specified, 
by a further rotation by the angle Defining £ such that setting 
£ = makes R aa i have the same form as R„, means that we can 
write down the full rotation matrix as 



CgS^C^ + C^S/: 
-CgS,/,Sg + CfiCf 



SgC f 
SgSf 
Cg 



(B3) 



where we have used c x and s x as shorthands for cos X and sin X, 
for brevity. 

The image plane (section 2.4) is based on a projection of the 
galaxy major and minor axes, corresponding to the image x and 
y axes respectively, but also allowing for a rotation of £ about the 
major axis. This means that, to rotate into the image frame from the 
galaxy frame, we use 



1 
-sinf cosf 
-cos<f -sin£ , 



(B4) 



Putting these together, a vector in the simulation frame p sj 
can be transformed into the image frame simply by 



Rirag Rgal RhP" 



(B5) 



In practice, we will have the halo mass distribution matrix M (see 
section 2.2), measured in the simulation frame. We therefore trans- 
form this into the image frame by 



M" 



(RimgRgalRh) M (RimgRgalRh) 



(B6) 



The projected mass distribution in the image plane is then the top- 
left 2x2 submatrix of M lmg . The eigenvalues and eigenvectors of 
this can then easily be found, with the axis lengths of the projected 
halo ellipse being given by the square root of the eigenvalues. 



APPENDIX C: RESULTS FROM OTHER REDSHIFTS 

We show here stacked projected halo shapes at different redshifts. 
They show essentially the same dependences on halo and galaxy 
properties as have already been illustrated, but nevertheless give an 
idea of another parameter that can have a significant quantitative 
impact on the results. 

We choose z — 0.50 (MS snapshot 48) as a higher redshift 
for analysis. The results for the B06 and DLB07 models are shown 
in Figs. CI and C2 respectively. There is very little change from 
the z — 0.32 results in Figs. 7 & 8. When the Split distribution is 
used, the results for "All" haloes are noticably less circular, imply- 
ing a greater proportion of elliptical galaxies. This might initially be 
seem to contradict the findings of Parry, Eke & Frenk (2009): they 
showed that, at higher redshifts there should be more disc galaxies, 
and fewer ellipticals. However, they found that this is strongest for 
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Figure CI. As Fig. 7 (i.e. using B06), but using redshift z — 0.5. 
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Figure C2. As Fig. 8 (i.e. using DLB07), but using redshift z = 0.5. 



the DLB07 model (where we see less change) and very weak for 
B06 (where we see the largest difference). The key is that while 
Parry, Eke & Frenk (2009) select bright galaxies at each redshift, 
selecting by K-band absolute magnitude M K - 5 log 10 h < -22.17, 
we are using a deeper cut in apparent magnitude: at this redshift, we 
are selecting galaxies brighter than M r - 51og 10 /i = -17.3. Thus, 
we are sampling more low-mass systems than Parry, Eke & Frenk 
(2009) (at all redshifts), but fewer than we were at z — 0.32. Our 
sample has proportionally more higher-mass systems (the biggest 
difference in shape is with M Smp ), which tend to host elliptical 
galaxies. The changes among the bright galaxy population seen in 
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Figure C3. As Fig. 7 (i.e. using B06), but using redshift z ss 0.17. 
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that a colour cut is no longer an efficient way of selecting the more 
aspherical haloes. 

Taken together, these results emphasise those from the main 
body of the paper. Under the alignment model designed to fit recent 
galaxy formation simulations, the alignment is sufficiently poor that 
very little changes the result. However, if there is a significant pop- 
ulation with very good alignement, then the resulting stacked shape 
will depend sensitively on the mass, shape, colour and morphology 
distribution of the galaxy-halo systems; all of which depends on 
redshift, and, at present, the galaxy model used. 



Figure C4. As Fig. 8 (i.e. using DLB07), but using redshift z - 0.17. 



Parry, Eke & Frenk (2009) are secondary to the overall change in 
galaxy demographic at higher redshift. 

Figs. C3 & C4 show our results at a lower redshift, z — 0.17 
(MS snapshot 56; here, our selection cut is at M r — 51og 10 /i = 
-14.6). In this case, the results from the B06 model show a not- 
icably less circular shape for blue and disc galaxies. Since this is 
only significant for the Parallel alignment model, it must be due to a 
greater intrinsic correlation between elliptical haloes and blue/disc- 
dominated galaxies at this redshift. Furthermore, the effect is not 
apparent in the DLB07 model. The colour distribution in DLB07 
is much broader, particularly for blue galaxies (see Fig. 2), such 



